Two Principles and Six Techniques for Rapid Mt Development
نویسندگان
چکیده
In this paper we describe a range of techniques used at NMSU CRL for accelerating the development of MT systems. These techniques enable semi-automatic development of a number of components of a multilingual MT system, thereby enabling rapid deployment of MT capabilities in a new language. First, we describe the core multi-engine, multilingual architecture that enables the different techniques to be rapidly integrated to build an MT system. We show how off-the-shelf components were used in this architecture for fast development. Then we illustrate a set of techniques for semi-automatic acquisition of static resources: (a) automatic induction of grammars, (b) corpus-based acquisition of bilingual glossaries, and automatic acquisition of semantic lexicons through (c) lexical rules and (d) reversal of analysis lexicons to generation lexicons. Finally we describe an automatic testing environment that enables rapid validation of automatically acquired resources. 1 Rapid Development Techniques Static knowledge sources — grammars, lexicons, world knowledge bases — are the most time-consuming concerns in any rule-based machine translation system. It is, therefore, imperative to find ways of speeding up the creation and updating of high-quality, useful static knowledge sources. It is equally imperative to rely on a robust and flexible core computational architecture that allows the concurrent manipulation of a large number of static and dynamic knowledge sources as well as documents and document collections. In this paper, we describe several techniques for facilitating rapid development of MT capabilities for a new language in the framework of an existing multilingual system. Our approach is based on the following two principles: • Heterogeneous, Multi-Engine, Multilingual Architecture: a multi-engine architecture where different subsets of MT techniques can be combined for different languages, accelerates development; it takes longer to perfect any one prespecified MT method for a new language to deliver comparable initial capabilities.
منابع مشابه
Chlorophyll meter – a decision-making tool for nitrogen application in wheat under light soils
Nitrogen (N) in plants is generally diagnosed by a soil test and plant tissue analysis.However, such analyses are costly in terms of time and money and are not easily accessible byresearchers and extension workers, let alone farmers. Alternative cost-effective methods arerequired for rapid analysis of the N status of crops and to guide N management in wheat. Theobjective of this study was to as...
متن کاملGlow Discharge Depth Profiling a Powerful Analytical Technique in Surface Engineering (TECHNICAL NOTE)
A variety of analytical techniques have been developed and employed to characterize the surfaces, subsurfaces and interfaces of surface engineering systems. They provide important information for quality control, process optimization and further development. Since the mid 1980's, glow discharge spectrometry (GDS) has emerged as an important and versatile technique for rapid depth profiling anal...
متن کاملThe Effectiveness of Mirror Therapy on Upper Limb Function in Stroke Patients: A Single Case Experimental Design
Objectives: To assess the effectiveness of mirror therapy (MT) on upper limb (UL) function of sub-acute stroke patients. Methods: This study is a single case experimental design with two participants. Twenty minutes of MT were implemented four times a week over a period of four weeks. For baseline phase, repeated measurements were performed six times for one participant and four times for the ...
متن کاملDeveloping Goodson’s model for rapid performance assessment of emergency department
Over the past years, raising costs of health care in most countries cause to attract more attention to different aspects in the field. One of the best improvement methodologies known in literature is based on lean principles. The main aim of this methodology is to create values in the system by eliminating losses and creating continuous efforts toward improvement. Therefore, by measuring the pe...
متن کاملPolyomavirus middle T-induced mammary intraepithelial neoplasia outgrowths: single origin, divergent evolution, and multiple outcomes.
The development of models to investigate the pathobiology of premalignant breast lesions is a critical prerequisite for development of breast cancer prevention and early intervention strategies. Using tissue transplantation techniques, we modified the widely used polyomavirus middle T (PyV-mT) transgenic mouse model of breast cancer to study the premalignant stages of tumorigenesis. Premalignan...
متن کامل